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(54) Title: NUCLEIC ACID INTEGRATION IN EUKARYOTES 

(57) Abstract: The invention relate to the field of molecular biology and cell biology. It particularly relates to methods to direct 
integration of a nucleic acid of interest towards homologous recombination and uses thereof. The present invention discloses factors 
involved in integration of a nucleic acid by illegitimate recombination which provides a method to direct integration of a nucleic 
acid of interest to a pre-determined site, whereby said nucleic acid has homology at or around the said pre- determined site, in a 
eukaryote with a preference for non-homologous recombination comprising steering an integration pathway towards homologous 
recombination. Furthermore, the invention provides a method to direct integration of a nucleic acid of interest to a sub-telomeric 
and/or telomeric region in a eukaryote with a preference for non -homologous recombination. 
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Title: Nucleic acid integration in eukaryotes 

The invention relates to the field of molecular biology and ceU biology. It 
particularly relates to methods to direct integration towards homologous 
5 recombination and uses thereof. Several methods are known to transfer nucleic 
acids to, in particular, eukaiyotic cells. In some methods the nucleic acid of 
interest is transferred to the cytoplasm of the cell, in some the nucleic acid of 
interest is integrated into the genome of the host. Many different vehicles for 
transfer of the nucleic acid are known. For different kinds of cells, different 
10 systems can be used, although many systems are more widely applicable than 
just a certEiin kind of cells. In plants, e.g., a system based on Agrobacterium 
tuinefaciens is often applied. This system is one of the systems that is used in a 
method according to the invention. 

The soil bacterium Agrobacterium tumefaciens is able to transfer part of 
15 its tumor-inducing (Ti) plasmid, the transferred (T-) DNA, to plant cells. This 
resxilts in crown gaU tumor formation on plants due to expression of oTic-genes, 
which are present on the T-DNA. Virulence {vir) genes, located elsewhere on 
the Ti-plasmid, mediate T-DNA transfer to the plant cell. Some Vir proteins 
accompany the T-DNA during its transfer to the plant cell to protect the T- 
20 DNA and to mediate its transfer to the plant nucleus. Once in the plant 

nucleus, the T-DNA is integrated at a random position into the plant genome 
(reviewed by (Hooykaas and Beijersbergen, 1994), (Hansen and Chilton 1999). 
Removal of the oTic-genes from the T-DNA does not inactivate T-DNA transfer. 
T-DNA, disarmed in this way, is now the preferred vector for the genetic 
25 modification of plants. 

Although much is known about the transformation process, not much is 
known about the process by which the T-DNA is integrated into the plant 
genome. It is likely that plant enzymes mediate this step of the transformation 
process (Bundock at al. 1995). The integration pattern of T-DNA in 
30 transformed plants has been extensively studied (Matsumoto at al. 1990) 
(Gheysen et al 1991) (Meyerhofer et al 1991). The resxilts indicated that T- 
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DNA integrates via illegitimate recombination (IR) (also called non- 
homologous recombination, both terms may be used interchangeable herein), a 
process which can join two DNA molecides that share little or no homology 
(here the T-DNA and plant target DNA). Even T-DNA molecules in which a 
5 large segment of homologous plant DNA was present, integrated mainly by IR 
and only with very low frequency (1:10^10^) by homologous recombination 
(HR) (Offringa et al. 1990). 

Recently, it was shown that Agrobacterium is not only able to transfer 
its T-DNA to plant cells, but also to other eukaryotes, including the yeast 

10 S.cerevisiae (Bundock et al. 1995) and a wide variety of filamentous fungi (de 
Groot et al. 1998). In S.cerevisiae^ T-DNA carrying homology with the yeast 
genome integrates via HR (Bundock et al. 1995). However, T-DNA lacking any 
homology with the S.cerevisiae genome becomes integrated at random 
positions in the genome by the same IR process as is used in plants (Bxmdock 

15 and Hooykaas 1996). Apparently, etdcaryotic cells have at least two separate 
pathways (one via homologous and one via non-homologous recombination) 
through which nucleic acids (in particiilar of coixrse DNA), can be integrated 
into the host genome. The site of integration into a host ceU genome is 
important with respect to the likelihood of transcription and/or expression of 

20 the integrated nucleic acid. The present invention provides methods and 
means to direct nucleic acid integration to a pre-determined site through 
steering integration towards the homologous recombination pathway. The 
present invention arrives at such steering either by enhancing the HR 
pathway, or by inhibiting (meaning reducing) the IR pathway. 

25 Host factors involved in the integration of nucleic acid by IR have so far 

not been identified. The present invention discloses such factors which enables 
the design of methods for their (temporary) inhibition, so that integration of 
nucleic acid by IR is prevented or more preferably completely inhibited, 
shifting the integration process towards HR and facilitating the isolation of a 

30 host cell with nucleic acid integrated by HR at a predetermined site. This is 
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extremely important, since there is no method available yet for easy and 
precise genetic modification of a host cell using HR (gene targeting). Of course 
the actual site of integration is then determined by homology of the nucleic 
acid of interest with said site. 
5 In a first embodiment the invention provides a method to direct nucleic 

acid integration of a nucleic acid of interest to a pre-determined site, whereby 
said nucleic acid has homology at or around the said pre-determined site, in a 
eukaryote with a preference for non-homologous recombination comprising 
steering an integration pathway towards homologous recombination. 

10 Preferably, such a method comprises at least the steps of introducing said 

nucleic acid of interest to a cell of said eukaryote, for example by the process of 
transformation or electr op oration, and integration of said nucleic acid in the 
genetic material of said cell. Integration is a complex process wherein a nucleic 
acid sequence becomes part of the genetic mateiial of a host cell. One step in 

15 the process of nucleic acid integration is recombination; via recombination 
nucleic acid sequences are exchanged or inserted and the introduced nucleic 
acid becomes part of the genetic material of a host cell. In principle two 
different ways of recombination are possible: homologous and illegitimate or 
non-homologous recombination. Most (higher) evikaryotes do not or at least not 

20 significantly practise homologous recombination although the essential 
proteins to accompHsh such a process are available. One reason for this 
phenomenon is that frequent use of homologous recombination in (higher) 
eukaryotes could lead to undesirable chromosomal rearrangements due to the 
presence of repetitive nucleic acid sequences. To accomphsh homologous 

25 recombination via a method according to the invention, it is important to 
provide a nucleic acid which has homology with a pre-determined site. It is 
clear to a person skilled in the art that the percentage of homology and the 
length of (a) homologous region(s) play(s) an important role in the process of 
homologous recombination. The percentage of homology is preferably close to 

30 100%. A person skilled in the art is aware of the fact that lower percentage of 
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homology are also used in the field of homologous recombination, but 
dependent on, for example, the regions of homology and their overall 
distribution, can lead to a lower efficiency of homologous recombination but 
are still useful and therefore included in the present invention. Furthermore, 
5 the length of a (nearly) homologous region is approximately 3 kb which is 

sufficient to direct homologous recombination. At least one homologous region 
is necessary for recombination but more preferably 2 homologous regions 
flanking the nucleic acid of interest are used for targeted integration. The 
researcher skilled in the art knows how to select the proper percentage of 

10 homology, the length of homology and the amount of homologous regions. By 
providing such a homology a nucleic acid is integrated at every desired 
position within the genetic material of a host cell. It is clear to a person skilled 
in the art that the invention as disclosed herein is used to direct any nucleic 
acid (preferably DNA) to any pre-determined site as long as the length of 

15 homology and percentage of homology are high enough to provide/enable 

homologous recombination, A pre-determined site is herein defined as a site 
within the genetic material contained by a host cell to which a nucleic acid 
with homology to this same site is integrated with a method according to the 
invention. It was not until the present invention that a nucleic acid is 

20 integrated at every desired position and therefore a method according to the 
invention is applied, for example, to affect the gene function in various ways, 
not only for complete inactivation but also to mediate changes in the 
expression level or in the regulation of expression, changes in protein activity 
or the subcellular targeting of an encoded protein. Complete inactivation, 

25 which can usually not be accomphshed by existing methods such as antisense 
technology or RNAi technology (Zrenner et al, 1993), is useful for instance for 
the inactivation of genes controlling undesired side branches of metabolic 
pathways, for instance to increase the quality of bulk products such as starch, 
or to increase the production of specific secondary metabohtes or to inhibit 

30 formation of imwanted metabolites. A method according to the invention is 
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also used to inactivate genes controlling senescence in fruits and flowers or 
that determine flower pigments. Replacement of existing regxilatory sequences 
by alternative regulatory sequences is used to alter expression of in situ 
modified genes to meet requirements, (e.g. expression in response to particular 
5 physical conditions such as light, drought or pathogen infection, or in response 
to chemical inducers), or depending on the developmental state (e.g. in a 
storage organ, or in fruits or seeds) or on tissue or cell types. Also a method 
according to the invention is used to effectuate predictable expression of 
transgenes encoding novel products, for example by replacing existing coding 

10 sequences of genes giving a desired expression profile by those for a desired 

novel product. For example to produce proteins of medicinal or industrial value 
in the seeds of plants the coding sequence of a strongly expressed storage 
protein may be replaced by that of the desired protein. As another example 
existing coding sequences are modified so that the protein encoded has 

15 optimized characteristics for instance to make a plant herbicide tolerant, to 
produce storage proteins with enhanced nutritional value, or to target a 
protein of interest to an organelle or to secrete it to the extracellialar space. As 
yet another example eukaryotic cells (including yeast, fungus, plant, 
mammalian cells or (non-human) animal cells) are provided with a gene 

20 encoding a protein of interest integrated into the genome at a site which 

ensures high expression levels. As another example the nucleic acid of interest 
can be part of a gene delivery vehicle to dehver a gene of interest to a 
eukaryotic cell in vitro or in vivo. In this way a defect p53 can be replaced by 
an intact p53. In this way a tumoricidal gene is dehvered to a pre-determined 

25 site present only in e.g. proliferating cells, or present only in txunor cells, e.g. 
to the site where a tumor antigen is expressed from. Gene dehvery vehicles are 
well known in the art and include adenoviral vehicles, retroviral vehicles, non- 
viral vehicles such as hposomes, etc. As another example the invention is used 
to produce transgenic organisms. Knock-out transgenics are already produced 

30 by homologous recombination methods. The present invention improves the 
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ef&ciency of such methods. Also transgenics with desired properties are made. 
It is clear to a person skilled in the art that transgenics can for example be 
made by the use of Agrobacterium as gene delivery vehicle for plant (Vergxmst 
et al., 1998), yeast (Bundock et aL, 1995), fimgus (de Groot et al., 1998) or 
5 animal (Kunik et al., 2001) or by direct DNA delivery methods exemplified by 
but not restricted to electroporation for yeast (Gietz & Woods, 2001), plant 
(D'HaUuin et al., 1992; Lin et al., 1997), fungus (Ozeki et al, 1994) and animal 
(Templeton et al., 1997), LiCl treatment for yeast (Schiestl et al,, 1993), micro- 
injection for plant (Schnorf et ad., 1991) and animal (Capecchi, 1980) and 

10 "DNA whiskers" for plant (Kaeppler et al., 1990; Dunwell, 1999) or particle 
bombardment for plants and animals (Elein et aL, 1992). It is furthermore 
clear that transgenic plants can be obtained via selective regeneration of 
transformed plant cells into a complete fertile plant (Vergunst et al., 1998) or 
via non-regenerative approaches by transforming germ line cells, exemplified 

15 by but not restricted to dipping Arabidopsis flowers into an Agrobacterium 

suspension (Bechtold et al., 1993). It is also clear that transgenic animals can 
be obtained by transforming embryonic stem cells with one of the DNA 
delivery methods mentioned above (Hooper, 1992). 

In another embodiment the invention provides a method to direct 

20 nucleic acid integration to a pre-determined site, whereby said nucleic acid has 
homology at or around the said pre-determined site, in a eukaryote with a 
preference for non-homologous recombination comprising steering an 
integration pathway towards homologous recombination by providing a 
mutant of a component involved in non-homologous recombination. Methods to 

25 identify components involved in non-homologous recombination are outhned in 
the present description wherein S.cerevisiae was used as a model system. To 
this end several yeast derivatives defective for genes known to be involved in 
various recombination processes were constructed and the effect of the 
mutations on T-DNA integration by either HR or IR was tested. The resiilts as 

30 disclosed herein show that the proteins encoded by YKU70, RAD50, MREll, 
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XRS2, LIG4 and SIR4 play an essential role in DNA integration by IR but not 
by HR. WO 00/12716 describes a maize Ku70 orthologue and suggests that 
"Control of homologous recombination or non-homologous end joining by 
modulating Ku provides the means to modulate the efficiency which 
5 heterologous nucleic acids are incorporated into the genomes of a target plant 
cell." WO 00/68404 describes a maize RadSO orthologue and suggest an 
analogous control for Rad50. Both patent applications do however not disclose, 
in contrast to the present patent apphcation, that by preventing or more 
preferably completely inhibiting non-homologous recombination, for example 

10 by providing a mutant of a component involved in non-homologous 

recombination or by inhibiting such a component, that the integration 
pathway is steered towards homologous recombination. It is clear to a person 
skilled in the art that different mutants of a component involved in non- 
homologous recombination exist. Examples are deletion mutants, knock-out 

15 (for example via insertion) mutants or point mutants. Irrespective of the kind 
of mutant it is important that a component involved in non-homologous 
recombination is no longer capable or at least significantly less capable to 
perform it's function in the process of non-homologous recombination. As 
disclosed herein disruption of YKU70, RAD SO, MREll, XRS2, LIG4 and SIR4 

20 did not affect the jfrequency of DNA integration by HR, showing that these 

genes are not involved in DNA integration by HR, but only in DNA integration 
by IR. More over, in the wild-type yeast strain 85% of the integration events 
occurred by HR (37% by replacement and 63% by insertion) and 15% by IR. In 
contrast, integration occurred only by HR in yeast strains lacking ku 70 or lig4. 

25 In radSO and xrs2 mutant strains the T-DNA preferentially integrated by HR 
(92%) and 93% of these T-DNAs integrated by replacement and only 7% by 
insertion. Thus, the absence of a functional radSO or ocrs2 gene leads to a 
significantly increased firequency of replacement reactions- 

In another embodiment the invention provides a method to direct 

30 integration of a nucleic acid of interest to a sub-telomeric andyor telomeric 
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region in a eukaryote with a preference for non-homologous recombination by 
providing a mutant of a component involved in non-homologous recombination. 
A telomeric region is typically defined as a region containing repetitive 
sequences which is located at the end of a chromosome. A sub-telomeric region 
5 is t3rpically defined as a region flanking the telomeric region. As an example it 
is disclosed herein that in yeast strains carrying disruptions of RAD50, 
MREll or XRS2 the distribution of integrated DNA copies is altered when 
compared to wildt3^e. DNA becomes preferentially integrated in telomeres or 
sub telomeric regions in the radSO, mrell and xrs2 mutants. A great advantage 

10 of integration of DNA copies in telomeres or subtelomeric regions instead of 
integration elsewhere in the genomic material is that there is no danger for 
host genes being mutated or inactivated by a DNA insertion. When in plants 
deficient for RAD50, MREll or XRS2 DNA copies also integrate into telomeres 
or subtelomeric regions, such plants are used for (sub)telomeric targeting of T- 

15 DNA in transformation experiments to prevent additional insertion mutations 
from random T-DNA integration into the plant genome. 

In yet another embodiment the invention provides a method to 
direct nucleic acid integration to a pre -determined site, whereby said nucleic 
acid has homology at or around the said pre-determined site, in a eukaryote 

20 with a preference for non-homologous recombination comprising steering an 
integration pathway towards homologous recombination by partially or more 
preferably completely inhibiting a component involved in non-homologous 
recombination. Partial or complete inhibition of a component involved in non- 
homologous recombination is obtained by different methods, for example by an 

25 antibody directed against such a component or a chemical inhibitor or a 
protein inhibitor or peptide inhibitor or an antisense molecixle or an RNAi 
molecule. Irrespective of the kind of (partial or more preferably complete) 
inhibition it is important that a component involved in non-homologous 
recombination is no longer capable or at least significantly less capable to 

30 perform it's function in the process of non-homologous recombination. In yet 
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another embodiment the invention provides a method to direct integration of a 
nucleic acid of interest to a sub-telomeric and/or telomeric region in a 
eukaryote with a preference for non-homologous recombination by partially or 
more preferably completely inhibiting a component involved in non- 
5 homologous recombination. Preferably, said component involved in non- 
homologous recombination is radSOy mrell or xrs2. 

In a preferred embodiment the invention provides a method to direct 
nucleic acid integration to a pre-determined site or to a sub-telomeric and/or 
telomeric region by providing a mutant of a component involved in non- 
10 homologous recombination or by partially or more preferably completely 

inhibiting a component involved in non-homologous recombination wherein 
said component comprises ku70, radSO^ mrelly xrs2y lig4y sir4 or others such 
as ku80 (Tacciole et al., 1994; Milne et al., 1996), lifl (Teo and Jackson, 2000; 
XRCC4 in human, see figure 6; Jimop et aL, 2000) and nejly (Kegel et al., 
15 2001; Valencia et al., 2001). Components involved in non-homologous 
recombination are identified as outlined in the present description. The 
nomenclature for genes as used above is specific for yeast. Because the 
nomenclature of genes differs between organisms a functional equivalent or a 
functional homologue (for example NBSl, a human 3crs2 equivalent (Paull and 
20 Gellert, 1999) and see for example figure 2 to 5) and/or a functional firagment 
thereof, all defined herein as being capable of performing (in function, not in 
amount) at least one function of the yeast genes ku70, radSO^ mrelly xrsS, lig4, 
sir4y kuSOy lifl or nejl are also included in the present invention. A mutant of 
a component directly associating with a component involved in non- 
25 homologous recombination or (partial or complete) inhibition of a component 
directly associating with a component involved in non-homologous 
recombination is also part of this invention. Such a component directly 
associating with a component involved in non-homologous recombination is, 
for example, identified in a yeast two hybrid screening. An example of a 
30 component directly associating with a component involved in non-homologous 
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recombination is ICU80, which forms a complex with KU70, In a more 
preferred embodiment the invention provides a method to direct nucleic acid 
integration in yeast, fungus, plant or (non-human) animal (cells). 

In another embodiment the invention provides a method to direct 
5 nucleic acid integration to a pre -determined site, whereby said nucleic acid has 
homology at or around the said pre-determined site, in a eukaryote with a 
preference for non-homologous recombination comprising steering an 
integration pathway towards homologous recombination by transiently 
(partially or more preferably completely) inhibiting integration via non- 
10 homologous recombination. In yet another embodiment the invention provides 
a method to direct integration of a nucleic acid of interest to a subtelomeric 
and/or telomeric region in a eiikaryote with a preference for non-homologous 
recombination by transiently (partially or more preferably completely) 
inhibiting integration via non-homologous recombination. In a more preferred 
15 embodiment, such a method is used for yeast, plant, fungus or (non-himian) 
animal and the transient (partial or more preferably complete) inhibition is 
provided by a (preferably stably) inserted and expressed chimeric transgene 
that encodes a peptide inhibitory to one, some or all non-homologous 
recombination (NHR) enzymes fused to a nuclear localisation signal (Hanover, 
20 1992; Raikhel, 1992) and the steroid-binding domain of a steroid receptor 
(Picard et aL, 1988). The chimeric transgene is constructed in such, a way, 
using either heterologous or non-heterologous promoter sequences and other 
expression signals, that it provides (stable) expression in the target cells or 
tissue for transformation. In the absence of the steroid hormone, the steroid- 
25 binding domain binds to chaperone proteins, and thereby the fusion protein is 
retained in the cjrtoplasm. Upon treatment with the steroid hormone, the 
chaperones are released from the steroid-binding domain and the inhibitory 
peptide will enter the nucleus where it will interact with and inhibit the action 
of NHR enzymes. An example of an inhibitory peptide is a KU80 fragment that 
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imparts radiosensitivity to Chinese hamster ovary cells (Marangoni at al., 
2000). 

In a more preferred embodiment such a method is used for yeast, 
plant, fungus or a (non-human) animal and the transient (partial or more 
5 preferable complete) inhibition is provided by an Agrobacterium Vir-fusion 
protein capable of (partially or more preferably completely) inhibiting a 
component involved in non-homologous recombination or capable of (partially 
or more preferably completely) inhibiting a functional equivalent or homologue 

•I* 

thereof or capable of (partially or more preferably completely) inhibiting a 
10 component directly associating with a component involved in non-homologous 
recombination. In an even more preferred embodiment such an Agrobacterium 
Vir fusion protein comprises VirF or VirE2. It was shown that the 
Agrobacterium VirF and VirE2 proteins are directly transferred from 
Agrobacterium to plant cells during plant transformation (V ergunst et al. 
15 2000). To, for example, accomplish T-DNA integration by HR in plants, VirF 
fusion proteins contaiiiing for example a peptide inhibitor of IR in plant cells 
are introduced concomitantly with the targeting T-DNA. It has been reported 
that the C-terminal part (approximately 40 amino acids) of VirF or VirE2 is 
sufficient to accomplish transfer of T-DNA. A functional fragment and/or a 
20 fimctional equivalent of VirF or VirE is therefore also included in the present 
invention. Preferably, said nucleic acid of interest is delivered to a cell of said 
eukaryote by Agrobacterium. 

In an even more preferred embodiment a component involved in non- 
homologous recombination comprises feu 70, radSO, mrell, xrs2, lig4, sir 4, 
25 ku80, lifl or nejl or fimctional equivalents or homologous thereof or 

associating components. The nomenclature for genes as used above is specific 
for yeast. Because the nomenclature of genes differs between organisms a 
functional equivalent or a functional homologue (see for example figure 2 to 5) 
and/or a functional fragment thereof, all defined herein as being capable of 
30 performing (in function, not in amount) at least one function of the yeast genes 
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ku70y radSO, mrell, ocrs2, lig4y sir4, ku80, lifl or nejlare also included in the 
present invention. By transiently (partially or more preferably completely) 
inhibiting a component involved in non-homologous recombination a nucleic 
acid is integrated at any desired position without permanently modifying a 
5 component involved in non-homologous recombination and preventing 

unwanted side effects caused by the permanent presence of such a modified 
component involved in non-homologous recombination. 

Methods according to the present invention, as extensively but not 
hmiting discussed above, are used in a wide variety of applications. One 
10 embodiment of the present invention is the replacement of an active gene by 
an inactive gene according to a method of the invention. Complete 
inactivation, which can usually not be accompHshed by existing methods such 
as antisense technology or RNAi technology, is useful for instance for the 
inactivation of genes controlling undesired side branches of metabohc 
15 pathways, for instance to increase the quality of bulk products such as str: : 
or to increase the production of specific secondary metabolites or to inhibit 
formation of unwanted metabolites. Also to inactivate genes controlling 
senescence in firuits and flowers or that determine flower pigments. Another 
embodiment of the present invention is the replacement of an inactive gene by 
an active gene. One example is the replacement of a defect p53 by an intact 
p53. Many tvimors acquire a mutation in p53 during their development w hich 
renders it inactive and often correlates with a poor response to cancer therai.*y. 
By replacing the defect p53 by an intact p53, for example via gene therapy, 
conventional anti cancer therapy have better changes of succeeding. In yet 
another embodiment of the invention a therapeutic proteinaceous substance is 
integrated via a method of the invention. In this way a tumoricidal gene ise 
delivered to a pre-determined site present only in e.g! proliferating cells, or 
present only in tumor cells, e.g. to the site where a tumor antigen is expressed 
from. In yet another embodiment the invention provides a method to introduce 
a substance conferring resistance for an antibiotic substance to a cell according 
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30 
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to a method of the invention. Also a method according to the invention is used 
to confer a desired property to an eukaryotic cell. In a preferred embodiment a 
gene delivery vehicle is used to deliver a desired nucleic acid to a pre- 
determined site. Gene delivery vehicles are well known in the art and include 
5 adenoviral vehicles, retroviral vehicles, non-viral vehicles such as hposomes, 
etc.. In this way, a for example, tumoricidal gene can be delivered to a pre- 
determined site present only in e.g. proliferating cells, or present only in tumor 
cells, e.g. to the site where a tumor antigen is expressed from. 

Furthermore a method according to the invention is used to improve 

10 gene targeting efficiency. Such a method is used to improve for example the 
gene targeting efficiency in plants. In plants transgenes integrate randomly 
into the genome by IR (Mayerhof et al. 1991), (Gheysen et al. 1991). The 
efficiency of integration by HR is very low, even when large stretches of 
homology between the transgene and the genomic target site are present 

15 (Offringa et al. 1990). Therefore, the efficiency of gene targeting using HR is 
very low in plants. The results that are disclosed herein show how to improve 
the gene targeting efficiency in plants. From the fact that T-DNA integration 
by IR is strongly reduced in KU70, BAD 50, MREll, XRS2, LIG4 and SIR4 
deficient yeast strains and T-DNA integration by HR is not affected in such 

20 strains, T-DNA integration by HR is more easily obtained in plants, deficient 
for either of these genes. Recently, we have cloned a KU70 homologue of 
Arabidopsis thaliana (see figure 2, Bundock 2000, unpublished data). RAD 50, 
MREll and LIG4 homologues have already been found in A,thaliana 
(GenBank accession nimibers AF168748, AJ243822 and AF233527, 

25 respectively, see also figure 3, 4 and 5 (Hartung and Puchta 1999). Currently, 
screenings are being performed to find plants carrying a T-DNA inserted in 
AtMREll, AtKUTO or AtLIG4, These knockout plants are used to test whether 
T-DNA integration by IR is reduced and integration by HR is essentially 
unaffected, thereby facilitating the detection of T-DNA integration by HR. 
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Fixrthermore, the invention provides a method to direct integration of a 
nucleic acid of interest to a pre-determined site, whereby said nucleic acid has 
homology at or around the said pre-determined site, in a eukaryote with a 
preference for non-homologous recombination, comprising steering an 
5 integration pathway towards homologous recombination, wherein said nucleic 
acid sequence of interest is essentially replacing a sequence within said 
exikaryote. As disclosed herein within the experimental part, in the wild-type 
yeast strain 85% of the integration events occurred by HR (37% by 
replacement and 63% by insertion) and 15% by IR. In contrast, integration 
10 occurred only by HR in yeast strains lacking ku70 or lig4. In radSO and xrs2 
mutant strains the T-DNA preferentially integrated by HR (92%) and 93% of 
these T-DNAs integrated by replacement and only 7% by insertion. Thus, the 
absence of a functional radSO or ocrs2 gene leads to a significantly increased 
firequency of the desired replacement reactions. 

15 

The invention will be explained in more detail in the following 
description, which is not limiting the invention. 
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EXPERIMENTAL PART 
Yeast strains. 

The yeast strains that were used are listed in Table 1. Yeast mutants 
5 isogenic to the haploid YPH250 strain were constructed using the one-step 
disruption method (Rothstein 1991). A 1987 bp fragment from the YKU70 
locus was amplified by PGR using the primers hdflpl 5'- 
GGGATTGCTTTAAGGTAG-3' and hdflp2 5'-CAAATACCCTACCCTACC-3'. 
The PGR product was cloned into pT7Blue (Novagen) to obtain 

10 pTTBlue YZn/ZO. A 1177 bp EcoRV/HindLlI fragment from the YKU70 ORF 
was replaced by a 2033 bp Hin^ll/Smal LEU2 containing fragment from 
pJJ283 (Jones and Prakash 1990), to form pT7BhieYKU70::LEU2. In order to 
obtain YKU70 disruptants Leu+ colonies were selected after transformation of 
YPH250 with a 2884 bp NdeVSmal fragment from pT7BhieYKU70::LEU2. 

15 The Expand™ High Fidelity System (Boehringer Mannheim) was used . . . 
according to the supplied protocol to amplify a 3285 bp fragment from the 
LIG4 locus with primers 

dnl4pl 5'-CGTAAGATTGGGGGAGTATAG-3' and 

dnl4p2 5'-CGTTTGAAATGGGAGGAGAGG-3'. The PGR product was cloned 
20 into pGEMT (Promega), resulting in pGEMTL/G4. A 1326 bp BamHUXhol 
fragment from pJJ215 (Jones and Prakash 1990) containing the HISS gene 
was inserted into the BamSi and Xhol sites of pIG20R, resulting in 
pIC20RiJJS5. A 782 bp EcoRl fragment from the LIG4 ORF was replaced with 
a 1367 bp EcoRI HISS containing fragment from pIC20RjHIiS5 to construct 
25 pGEMTLIG4:;jHIS5. In order to obtain LIG4 disruptants His+ colonies were 
selected after transformation of YPH250 with a 3854 bp Ncol/Notl fragment 
from i>GEMTLIG4::HIS3. In order to obtain RAD50 disruptants YPH250 was 
transformed with a EcoRl/Bglll fragment from pNKY83 and Ura"*" colonies 
were selected (Alani et al 1989). A rad50::hisG strain was obtained by 
30 selecting Ura- colonies on selective medium containing 5-FOA. Similarly 
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RAD 51 disruptants were obtained were obtained after transformation of 
YPH250 with a RAD51::LEU2 Xbal/Pstl fragment from pDGl52 and selection 
of Leu+ colonies (Schiestl et al 1994). The TRPl marker in pSM21 (Schild et al. 
1983) was replaced with a BgllUXbal LEU2 containing fragment from pJJ283 
5 (Jones and Prakash, 1990). This resvdted in pSM21L£;t7^. Leu+ RAD52 
disruptant colonies were selected after transformation of YPH250 with the 
RAI>52::LEU2 Bamfll fragment from pSM21LEU2. Disruption constructs 
were transformed to YPH250 by the lithiimi acetate transformation method as 
described (Gietz et al, 1992; Schiestl et al. 1993). Disruption of YKU70, LIG4, 
10 RAD50, RAD51 and RAD52 was confirmed by PGR and Southern blot 
analysis. 
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Table 1: Yeast strains 



Strain 


Genotype 


Reference 


YPH250 


MATa, ura3-52, lys2-801, ade2-101. 


(SikorsM and 




trpl-Al, his3-A200, Ieu2-Al 


Hieter 1989) 


YPK250rad51 


MATa, ura3-52, lys2-801, ade2-101. 


This st^ldy 




trpl-Al, his3-A200, Ieu2-Al, 




» 


rad51::LEU2 




YFm50rad52 


MATa, ura3-52, lys2-801, ade2-101, 


This study 




trpl-Al, his3-A200, Ieu2-Al, 






rad52::LEU2 




YPB250yku70 


MATa, ura3-52, lys2-801, ade2-101. 


This study 




trpl-Al, his3-A200, Ieu2-Al, 






yku70::LEU2 




YPH250rad50 


MATa, ura3-52, lys2-801, ade2-101. 


This study 




trpl-Al, his3-A200, leu2-Al,rad50::hisG 




YPH250Zi^4 


MATa, ura3-52, lys2-801, ade2-101. 


This study 




trpl-Al, his3-A200, Ieu2-Al, lig4::HIS3 




JKM115 


Aho, Ahml::ADEl, MATa, Ahmr::ADEl, 


(Moore and 




adel, leu2-3,112, lys5, trpl::hisG, 


Haber 1996) 




ura3-52 




JKM129 


Aho, Ahml::ADEl, MATa, Ahmr::ADEl, 


(Moore and 




adel, leu2-3,112, lys5, trpl::hisG, 


Haber 1996) 




ura3-S2, xrs2::LEU2 




JKM138 


Aho, Ahml::ADEl, MATa, Ahmr::ADEl, 


(Moore and 




adel, leu2-3,112, lys5, trpl::hisG, 


Haber 1996) 




ura3-52, mrell::hisG 




YSL204 


Aho, HMLa, MATa, HMRa, adel- 100, 


(Lee et al. 




leu2-3,112, lysS, trpl::hisG, ura3-52. 


1999) 




hisG'-URA3-hisG', sir4::HIS3 
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Construction of binary vectors. 

To construct pSDMSOOO a 1513 bp Pvull/EcoRV fragment carrying the 
KanMX maxk^ex was obtained from pFA6a (Wach et al. 1994) and was ligated 
into the unique Hpal site of pSDM14 (OfEringa 1992), pSDMSOOl was made in 
5 three cloning steps. A 1476 bp BamUl/EcoRl fragment carrying the KanMX 
marker was obtained from pFA6a and ligated into BamHl and EcoBl digested 
pIC20H to form pIC20H/eon.MX. The KanMX marker was inserted between the 
PDAl flanks by replacement of a 2610 bp Bglll fragment from pUC4ElalO 
(Steensma et al. 1990) with a 1465 Bglll fragment from pIC20HA;a7iMX. A 
10 3721 bp Xhol/Kpnl fragment from this construct was inserted into the Xhol 
and Kpnl sites of pSDM14. The binary vectors pSDMSOOO and pSDMSOOl 
were introduced into Agrobacterium tumefaciens LBA1119 by electroporation 
(den Dulk-Ras and Hooykaas 1995). 

15 Cocultivations / T-DNA transfer experiments, 

Cocultivations were performed as described earlier with shght 
modifications (Bundock et al. 1995). Agrobacterium was grown overnight in LC 
medium. The mix of Agrobacterium and S. cerevisiae cells was incubated for 9 
days at 20°C. G41S resistant S.cerevisiae strains were selected at 30®C on 
20 YPAD medium containing geneticin (200 jtg/ml) (Life Technologies/Gibco 
BRL). 

Vectorette PGR. 

Chromosomal DNA was isolated using Qiagen's Genomic Tips G/20 per 
25 manufacters protocol. 1-2 jig of Genomic DNA was digested with EcoRl, Clal, 
PstI or Hindlll and run on a 1% TBE-gel. Non-radioactive Southern blotting 
was performed. The membrane was hybridized with a digoxigenine-labeled 
kanMX probe to determine the size of T-DNA/genomic DNA fragments (EcoRI 
and CZal for RB containing fragments and PstI and Hindlll for LB containing 
30 fragments). The kanMX probe, a 792 bp internal fragment of the KanMX 
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marker, was made by PGR using primers kamnxpl 5'- 
AGACTCACGTTTCGAGGCC-3' and kanmxp2 5'- 

TCACCGAGGCAGTTCCATAG-3' and a Non-Radioactive DNA Labeling and 
Detection kit (Boebringer Mannbeim). Tbe enzyme sbowing tbe smallest band 
5 on blot was used for Vectorette PGR, in order to amplify the smallest junction 
sequence of T-DNA and genomic DNA. Vectorette PGR was performed as 
described 

(bt.tp://genomewww.staiiford.edii/group/botlab/protocols/vectorette.html). The 
Expand™ High FideUty System (Boehringer Mannheim) was used to amplify 
10 fragments larger than 2.5 kb, whereas sTaq DNA polymerase (SphaeroQ) was 
used for amplification of fragments smaller than 2.5 kb. Primer kanmxpS 5'- 
TGGCAGGTGTGGAGCGAGGAGG-3' and kanmxp4 5'- 

TGGGGTGGACATCATGTGCGGAG-3' were used to amplify RB/genomic DNA 
and LB/genomic DNA jxmction sequences, respectively. 

15 

T7 DNA Polymerase sequencing. 

Vectorette PGR products were cloned in pGEMTEasy (Promega) and 
sequenced using the T7 polymerase sequencing kit (Pharmacia) according to 
manvtfacturers protocol. In order to obtain sequences flanking the RB and LB, 
20 primers kanmxpS 5'-TGAGATGATGGGGGTGAGGTGG-3' and kanmxp4 were 
used, respectively. 
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RESULTS 



1. Binary vectors for T-DNA transfer to yeast. 

It was previously demonstrated that Agrobacterium tumefaciens is able 
5 to transfer its T-DNA not only to plants but also to another exikaryote, namely 
the yeast Saccharomyces cerevisiae (Bundock et al. 1995). T-DNA carrying 
homology with the yeast genome was shown to become integrated by 
homologous recombination. T-DNA lacking any homology with the yeast 
genome was integrated randomly into the genome by IR like in plants 

10 (Bxmdock et al. 1995, (Bundock and Hooykaas 1996). The T-DNA used in these 
experiments carried the S.cerevisiae URA3 gene for selection of Ura+ colonies 
after T-DNA transfer to the haploid yeast strain RSY12(I7jRA5z1). However, in 
this system only yeast strains could be used in which the URA3 gene had been 
deleted to avoid homology between the incoming T-DNA and the S.cerevisiae 

15 genome. 

We wanted to setup a system in which T-DNA transfer to any yeast 
strain could be studied. Therefore, two new binary vectors were constructed 
using the dominant marker kanMX (W Sich et al 1994) which confers resistance 
against geneticin (G418). The T-DNA of pSDMSOOO carries only the KanMX 

20 marker. Since this -KaTiMX" marker consists of heterologous DNA, lacking any 
homology with the S. cerevisiae genome, we would expect this T-DNA to 
integrate by IR at a random position in the yeast genome. To be able to 
compare this with T-DNA integration by homologous recombination 
pSDMSOOl was constructed. The T-DNA of pSDMSOOl carries the KanMX 

25 marker flanked by sequences from the S. cerevisiae PDAl locus. The PDAl 

sequences have been shown to mediate the integration of T-DNA by HR at the 
PDAl locus on chromosome V (Bimdock et al. 1995). 

Cocultivations between A^ro6acierw77i strains carrying pSDMSOOO and 
pSDMSOOl, respectively, and the haploid yeast strains YPH250 and JKM115, 

30 respectively, were carried out as described in the experimental part. G418 
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resistant colonies were obtained at low frequencies for YPH250 (1.6 x lO '^) and 
JKM115 (1.2 X 10-5) after T-DNA transfer from pSDMSOOO (Table 2). T-DNA 
transfer from pSDMSOOl generated G418 resistant colonies at higher 
frequencies (2.4 x 10-^ for YPH250 and 1.8 x 10-4 JKM115, Table 2). The ratio 
5 of homologous recombination versus illegitimate recombination is determined 
by comparing the frequencies of G418 resistant colonies obtained from 
cocultivations using either pSDMSOOl or pSDMSOOO. This showed that a T- 
DNA from pSDMSOOl was 150-fold more likely to integrate than a T-DNA 
from pSDMSOOO in YPH250 (Table 2). A similar difference was previously seen 
10 using T-DNAs with the URA3 marker (Bundock and Hooykaas 1996). In 
contrast, T-DNA from pSDMSOOl was only 16-fold more likely to integrate 
than a T-DNA from pSDMSOOO in JKM115. There was no significant 
difference in the frequency of T-DNA transfer to these two yeast strains as was 
determined by T-DNA transfer experiments in which a T-DNA, that carried 
15 the Kan^MX marker and the yeast 2 micron replicon, was employed. Therefore, 
the differences in the frequencies of T-DNA integration by HR and IR between 
the yeast strains YPH250 and JKM115, respectively, is most likely contributed 
to differences in the capacities of their HR and IR recombination machineries. 
We confirmed by PGR that the T-DNA from pSDMSOOl became 
20 integrated at the PDAl locus by homologous recombination (data not shown). 
In order to find out whether the T-DNA from pSDMSOOO had integrated 
randomly by IR yeast target sites for integration were determined from 8 G418 
resistant YPH250 colonies by Vectorette PGR (for detailed description see 
materials and methods). Ghromosomal DNA was isolated and digested with a 
25 restriction enzyme that cuts within the T-DNA. A Vectorette was ligated to the 
digested DNA and a PGR was performed using a T-DNA specific and a 
Vectorette specific primer. The PGR product obtained was cloned into 
pGEMTEasy and sequenced using a T-DNA specific primer. The position of the 
T-DNA insertion was determined by basic BLAST search of the yeast genome 
30 (http://www-genome.stanford.edu/SGD). We were thus able to map the position 
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of the T-DNA insertions of all 8 G418 resistant colonies analyzed. They were 
present at different positions spread out over the genome. Comparison of the 
T-DNA sequence and yeast target site sequences did not reveal any obvious 
homology. These data show that the T-DNA from pSDMSOOO had integrated 
5 via an IR mechanism as expected. 

The following characteristics have previously been observed for T-DNAs 
integrated by IR: a) the 3' end of the T-DNA is usually less conserved 
compared to the 5' end, b) microhomology is sometimes present between the T- 
DNA ends and the target site, c) integration is often accompanied by small 

10 deletions of the target site DNA (Matsumotot et al. 1990), (Gheysen et al. 
1991, (Mayerhofer et al. 1991), (Bundock and Hooykaas 1996). Similar 
characteristics were seen in the currently analyzed 8 T-DNA insertions. In 3 
strains we observed microhomology of 2 — 6 bp between the LB and yeast 
target site (figxire 1, WT.51 was taken as an example). In 5 strains deletions of 

15 1 — 5 bp of yeast target site DNA was found and we observed deletions, varying 
from 1 — 112 bp, of the 3' end of the T-DNA in 7 out of 8 analyzed strains. In 
only 1 strain the 3' end appeared to be intact. The 5' end of the T-DNA was 
conserved in almost all strains. In only 2 strains we observed small deletions 
of 1 and 2 bp at the 5' end of the T-DNA. 

20 Thus, we can conclude that the T-DNA from pSDMSOOO had integrated 

via the same IR mechanism described before. 
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Table 2: Frequencies of T-DNA integration by IR relative to integration by HR 
in recombination defective yeast strains 



Strain 


Genotype 


Freq. of 


Freq. of 
HR 


Absolute 

IR/HR 

ratio"* 


Standardized 
IR/HR ratiQC 


YPH250 


WT 


1.6 X 10-7 


2.4 X 10-5 


0.007 


1 


YPH250 


radSlA 


1.4 X 10-7 


1.5 X 10-6 


0.09 


14 


rod 51 












YPH250 


rad52A 


3.8 X 10-7 


2.5 X 10-6 


0.15 


23 


rad52 












YPH250 


yku 70a 


<3.2 X 10-9 


3.3 X 10-5 


<0.0001 


<0.01 


yku70 












YPH250 


radSOA 


8.0 X 10-9 


3.5 X 10-5 


0.0002 


0.03 


rod 50 












ilrriAoO 




3.7 X 10^ 


z.o X 10 ° 


KJ.VXJKJZ 


O.Oil 


lig4 












JKM115 


WT 


1.2 X 10-5 


1.8 X 10--* 


0.07 


1 


JKM129 


ocrs2/i 


2.7 X 10-7 


5.1 X 10-5 


0.005 


0.08 


JKM138 


mrel 1a 


2.9 X 10-7 


7.5 X 10-5 


0.004 


0.06 


YSL204 


sir4A 


1.5 X 10-7 


1.8 X 10-5 


0.008 


0.13 



a Averages of 2 or more independant experiments are shown. Frequencies are 
5 depicted as the number of G418 resistant colonies devided by the output 
number of yeast cells (cells/ml). 

b The frequency of T-DNA integration by IR (pSDMSOOO) devided by the 
frequency of T-DNA integration by HR (pSDMSOOl). 

c The ratio of IR/HR in the mutant strain devided by the ratio of IR/HR in the 
10 wildtype strain. 



wo 02052026A2„L> 



wo 02/052026 PCT/NLOl/00936 

24 

2. Host-specific factors involved in random T-DNA integration. 

The observation that the T-DNA from pSDMSOOO integrates by IR into 
the yeast genome allowed us to use this system to study the effect of host 
factors on the process of integration* Many proteins involved in various forms 
5 of DNA recombination have been identified in yeast. In order to determine the 
roles of a representative set of these enzymes in T-DNA integration, we 
compared T-DNA transfer and integration in wildtype yeasts with that of 
strains carrying disruptions of the genes encoding several recombination 
proteins. The RAD51, RADS2, KU70, RAD50 and LIG4 genes were deleted 

10 from YPH250 using the one step disruption method. Yeast strains carrying 
deletions in MREll, XRS2 and SIR4 in the JKM115 backgroimd were kindly 
provided by Dr. J. Haber. The results of cocultivations with these yeast strains 
are shown in Table 2. 

In radSl and radS2 mutants, which are impaired in homologous 

15 recombination, the frequency of T-DNA integration by HR was 16- and 9-fold 
lower, respectively, than observed for the wildtype (Table 2). This showed that 
RAD 51 and RAD 52 play a role in T-DNA integration by homologous 
recombination. In the IR defective ku70, radSO, lig4, mrell, xrs2 and sir 4 
mutants the frequency of T-DNA integration by HR did not differ significantly 

20 from that observed for wildtype (Table 2). This slxowed that these genes do not 
play a role in T-DNA integration by homologous recombination. 

The frequency of T-DNA integration by IR in a rad51 mutant did not 
differ significantly fi'om that observed for wildtjrpe, whereas in a radS2 mutant 
the fi^equency was about 2-fold higher (Table 2). This showed that RAD 51 and 

25 RAD52 are not involved in T-DNA integration by IR. The product of the 
RAD52 gene may compete with IR-enzymes for the T-DNA and thereby 
inhibits integration by IR to some extent. Strikingly, in radSO, mrell, xrs2, 
lig4 and sir4 mutants the frequency of T-DNA integration by IR was reduced 
dramatically: 20- to more than 40-fold (Table 2). T-DNA integration by IR 

30 seemed to be completely abohshed in the ku70 mutant. We did not obtain any 
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G418 resistant colonies from several cocultivation experiments. This strongly 
suggests that KU70 plays an important role in random T-DNA integration in 
yeast. 

Since T-DNA integration by HR is normal in these mutants, these 
5 resxilts clearly show that the yeast genes KU70, RAD 50, MREll, XRS2, LIG4 
and SIR4 are involved in T-DNA integration by illegitimate recombination. 

3, Chromosomal distribution of integrated T-DNA copies in. IR 
defective S.cerevisiae. 

10 From several cocultivation experiments with the rad50, mrell, ocrs2, 

lig4 and sir 4 mutants we obtained a small number of G418 resistant colonies. 
The T-DNA structure was determined for a ntimber of these lines. To this end 
chromosomal DNA was isolated from these G418 resistant colonies and 
subjected to vectorette PGR to amplify junction sequences of genomic DNA and 

15 T-DNA. PGR products were cloned and sequenced. The yeast sequences linked 
to the T-DNA were used in a BLAST search at http://www- 
genome.stanford.edu/SGD to determine the T-DNA integration sites. 

Strikingly, analysis of LB/genomic DNA junctions revealed that in 2 out 
of 3 radSO, 4 out of 6 mrell and 2 ocrs2 strains analyzed, T-DNAs had 

20 integrated in telomeres or subtelomeric regions {rad50k.l, radSOk.S, mrellk.8, 
mrellk.ll, mrellk.l4, mrellk.17, xrs2k.l and xrs2k. 1 7; Table 3 and figure 1). 
S. cerevisiae telomeres generally consist of one or more copies of the element 
followed by telomerase-generated C(l-3)A/TG(l-3) repeats (Zakian 1996). In 2 
radSO strains, 2 mrell strains and 1 xrs2 strain the LB was found to be fused 

25 to this typical telomerase-generated C(l-3)A/TG(l-3) repeat (radSOk.l, 

rad50k.6, mrellk.l4, mrellk.l7 and xrs2k.l\ figure 1). Besides, we also found 
one T-DNA insertion in a Ty LTR element in the mrell mutant and 2 
insertions in the rDNA region, present in chromosome XII, in the mrell and 
radSO mutants (mrellk.5, mrellkA and rad50k.5, respectively; Table 3 and 

30 figxire 1). 
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The 3' end of the T-DNA was truncated in all strains. Deletions of 3 - 11 
bp of the 3'end of the T-DNA were observed (figure 1). Microhomology between 
the 3' end of the T-DNA and yeast target site was only found in 2 lines (5 bp in 
mrellk.4 and 4 bp in mrellk.14; figvire 1). For the T-DNA copies present at 
5 the yeast telomeres, the RB/genomic DNA junction sequences could not be 
obtained from these strains using vectorette PGR. This was only possible for 
the radSO and mrell strains carrjdng the T-DNA in the rDNA region on 
chromosome XII. In both strains the RB was intact and no homology between 
the 5' end of the T-DNA and the yeast target site was found (data not shown in 
10 figure 1). 

Previously, target sites for T-DNA integration in the genome of 

* 

S.cerevisiae strain RSY12 were determined (Bundock and Hooykaas 1996), 
(Bundock 1999. In 4 out of 44 strains analyzed, T-DNA copies were integrated 
in rDNA, Ty LTR elements (in 2 strains) and a subtelomeric located T 

15 element, respectively. In addition, we determined the position of T-DNA 

integration in ten S.cerevisiae YPH250 strains. We did not find any T-DNA 
insertions in rDNA, LTR elements or subtelomeric/telomeric regions amongst 
these ten. Pooling all insertions analyzed in wildtype (54), in 2 out of 54 
strains analyzed (4%) insertions were found in a Ty LTR element and in two 

20 other strains in the rDNA repeat (2%) and a subtelomeric region (2%), 

respectively. In contrast, we report here that T-DNA in yeast strains mutated 
in RAD50, MREll or XRS2 T-DNA integrates preferentially in (sub)telomeric 
regions (8 out of 11 lines: '-73%) of rad50, mrell and xrs2 mutants (table 3). 
From the remaining strains two T-DNAs were present in rDNA and one in a 

25 Ty LTR element, respectively. Apparently, the rDNA repeat is also a preferred 
integration site in these mutants ('-18% vs. -2% in the wildtype). 

Telomeres consist of a large array of telomerase-generated C(l- 
3)AyTG(l-3) repeats (~350 bp). In the subtelomeric regions two common classes 
of Y elements, 6.7 and 5.2 kb, can be fovmd (in most strains chromosome I does 

30 not contain Y) (Zakian and Blanton 1988), making the average size of these 
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regions '--6,0 kb. Thus, the yeast genome contains (16 x 2 x 0.35) 4- (15 x 2 x 
6,0) = 191 kb of subtelomeric/telomeric sequences. The yeast genome is 12,052 
kb in size, which means that only 1.6% of the genome consists of 
subtelomeric/telomeric sequences. In accordance with this, we observed in only 
5 2% of the wildtype strains T-DNA copies inserted in a subtelomeric region, 

which we would expect on the basis of random T-DNA integration. In contrast, 
in the radSO, mrell and xrs2 mutants 73% of the T-DNA insertions were found 
in the (sub)telomeric region. 

Analysis of 7 lines revealed that in the sir4 mutant T-DNA was 

10 integrated randomly into the yeast genome. So, although SIR4 has an effect on 
the efficiency of T-DNA integration by IR, the pattern of T-DNA distribution in 
the transformants seems similar as in the wildtype strain. In the sir4 mutant 
T-DNA integration by IR was characterized by truncation of the 3' end of the 
T-DNA, deletions at the target site and microhomology between the LB and 

15 the target site (data not shown), like this was observed for T-DNA integration 
by IR in the wildtype. 

These results clearly show that in the radSO, mrel 1 and xrs2 mutants 
the T-DNA, if integrated at aU, becomes preferentially inserted in telomeres or 
subtelomeric regions and that the genomic distribution of integrated T-DNAs 

20 is altered when compared to wildtype. However, disruption oiSIR4 did affect 
the efficiency of T-DNA integration by IR, but not the characteristics of this 
process. 
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Table 3: genomic distribution of T-DNA integrated by IR in rad50, mrell and 
xrs2 mutants in comparison with the wildtype after T-DNA transfer from 
pSDMSOOO 



Yeast strain 


(Sub)Telomeric 
region 


LTR 


rDNA 


Elsewhere 


radSO mutant 


2 


0 


1 


0 


mrell mutant 


4 


1 


1 


0 


xrs2 mutant 


2 


0 


0 


0 


wildtj^e 


1 


2 


1 


50 



5 
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DESCRIPTION OF FIGURES 



Figxxre 1: Jtmction sequences of T-DNA and S.cerevisae genomic DNA. 
S.cerevisiae YPH250 (WT), radSO, mrell and ocrs2 strains were cocultivated 
5 with LBA1119(pSDM8000). G418 resistant colonies were obtained. 

Chromosomal DNA was isolated and subjected to Vectorette PCR to determine 
the sequence of genomic DNA flanking the T-DNA. Position of T-DNA 
integration was determined by basic BLAST search of the yeast genome at 
http:/www.genome-stanford.edxi/SGD. The Watson strand of genomic DNA 
10 that is fused to the LB or RB is shown in italics. Bold sequences represent 

sequence homology between the LB and target site. The filler DNA sequence is 
underlined and depicted in italics. The numbers above the LB sequences 
represents the number of bp deleted from the LB. Tel. = telomeric, SubteL = 
subtelomeric and Int. = intergenic. 

15 

Figure 2: Alignment of KU70 homologues. Sc = Saccharomyces 
cerevisiae, Hs = Homo sapiens and At = Arabidopsis thaliana. Perfect identity 
is depicted as black boxes, homology is depicted as grey boxes and dashes were 
used to optimise alignment. 

20 

Figure 3: Alignment of LIG4 homologues. Sc = Saccharomyces cerevisiae, 
Hs = Homo sapiens and At = Arabidopsis thaliana. Perfect identity is depicted 
as black boxes, homology is depicted as grey boxes and dashes were used to 
optimise alignment. 

25 
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Figure 4: Alignment of MREll homologues. Sc = Saccharomyces 
cerevisiae, Hs = Homo sapiens and At = Arabidopsis thaliana. Perfect identity 
is depicted as black boxes, homology is depicted as grey boxes and dashes were 
used to optimise alignment. 

5 

Figure 5: Alignment of RAD50 homologues. Sc = Saccharomyces 
cerevisiae, Hs = Homo sapiens and At = Arabidopsis thaliana. Perfect identity 
is depicted as black boxes, homology is depicted as grey boxes and dashes were 
used to optimise alignment. 

10 

Figure 6: Alignment of XRCC4 homologues. Sc = Saccharomyces 
cerevisiae, Hs = Homo sapiens and At = Arabidopsis thaliana. 

15 
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Claims 

1. A method to direct integration of a nucleic acid of interest to a pre- 
determined site, whereby said nucleic acid has homology at or around the said 

5 pre-determined site, in a eukaryote with a preference for non-homologous 
recombination, comprising steering an integration pathway towards 
homologous recombination. 

2. A method to direct nucleic acid integration according to claim 1, 
comprising providing a mutant of a component involved in non-homologous 

10 recombination, 

3. A method to direct nucleic acid integration according to claim 1 or 2, 
comprising inhibiting a component involved in non-homologous recombination. 

4. A method according to claim 2 or 3 wherein said component involved 
in non-homologous recombination comprises ku70, radSO, mrell, xrs2, lig4 or 

15 sir 4. 

5. A method to direct integration of a nucleic acid of interest to a pre- 
determined site according to anyone of claims 1 to 3, wherein said nucleic acid 
sequence of interest is essentially replacing a sequence within said eukaryote. 

6. A method to direct integration of a nucleic acid of interest to a pre- 
20 determined site according to claim 5, wherein said component involved in 

non-homolgous recombination comprises radSO or ocrs2. 

7. A method to direct integration of a nucleic acid of interest to a sub- 
telomeric and/or telomeric region in a eukaryote with a preference for non- 
homologous recombination by providing a mutant of a component involved in 

25 non-homologous recombination. 

8. A method to direct integration of a nucleic acid of interest to a sub- 
telomeric and/or telomeric region in a eukaryote with a preference for non- 
homologous recombination, comprising inhibiting a component involved in 
non-homologous recombination 
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9. A method to direct integration according to claim 7 or 8 wherein said 
component involved in non-homologous recombination comprises radSO, mrell 
or xrs2. 

10. A method according to anyone of claims 1 to 9 wherein said 
5 eukaryote comprises yeast, a fungus or an animal. 

11. A method according to anyone of claims 1 to 10, wherein said nucleic 
acid of interest is delivered to a cell of said eukaryote by Agrobacterium, 

12. A method according to anyone of claims 1-11 comprising transiently 
inhibiting integration via non-homologous recombination. 

10 13. A method according claim 12 wherein said transiently inhibiting is 

provided by an Agrobacterium Vir-fusion protein capable of inhibiting a 
component involved in non-homologous recombination. 

14. A method to direct nucleic acid integration according to claim 13 
wherein said Agrobacterium Vir fusion protein comprises VirF or VirE2. 

15 14. A method according to claim 13 or 14 wherein said component 

involved in non-homologous recombination comprises ku70, radSO, mrell, 
xrs2, lig4 or sir4, 

15. A method according to anyone of the aforegoing claims wherein said 
nucleic acid of interest comprises an inactive gene to replace an active gene. 

20 16. A method according to anyone of claims 1-14, wherein said nucleic 

acid of interest comprises an active gene to replace an inactive gene. 

17. A method according to anyone of claims 1-14, wherein said nucleic 
acid of interest encodes a therapeutic proteinaceous substance. 

18. A method according to anyone of claims 1-14, wherein said nucleic 
25 acid of interest encodes a substance conferring resistance for an antibiotic 

substance to a cell. 

19. A method according to anyone of claims 1-14, wherein said nucleic 
acid of interest confers a desired property to said eukaryotic cell. 

20. A method according to anyone of the aforegoing claims wherein said 
30 nucleic acid of interest is part of a gene delivery vehicle. 
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21. Use of a method according to anyone of claims 1 to 20 fox 

improvement of gene targeting efficiency. 
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Fig. 1 



Strain 



LB' T-DNA RB' 
CAGGATATATTCAATTGTAAAT-CTC CGA-GG 



Chromosome, coordinate and location 



WT.51 



-4 



5' ATrGTAriMATATTCAATTGTAAAT-CTC CGA-GG Tfl 3' XIV, 185311 (1 bp of target 

site DNA deleted), int. region 



rad50k.1 

7Xid50k.5 

vad50k.6 



-6 

STGTGGGTGTGATATTCAATTGTAAAT-CTC— CGA-GG 3' 

-7 

5' GGGGGCarCAGTATTCAATTGTAAAT-CTC CGA-GG 3' 

-25 

5' gaggtagatgtgagagagtgtgtgtgggtgtgragt:cga 3' 



XV, 1091277, tel. region 
Xn. 465986, rDNA region 
XV, 1091276, teL region 



nirellk.4 
mrellk.5 

nirellk.8 
nxrellk.ll 
mrellkA4 
nirellk.l7 



-3 

5' TCTGGrAGATATATTCAATTGTAAAT-CTC- 

-8 

5' CACATATrrCrCATTCAATTGTAAAT-CTC- 

-11 

5' CGACrACrrrAT ArOCA ATTGTAAAT-CTC- 

-7 

5' GAAGAaCCCAITATTCAATTGTAAAT-CTC- 

-7 

5' rGGGrGrGGGTTATTCAATTGTAAAT-CTC- 

-9 

5' rGGGTGrGGrGrGTTCAATTGTAAAT-CTC- 



•-CGA-GG 3' 
-CGA-GG 3' 

—CGA-GG 3' 
-CGA-GG 3' 
-CGA-GG 3' 
-CGA-GG 3' 



xn, 459692/468829, rDNA region 

Vn/X/Xni, 536090 OE 541678/ 
472487 OR 483659/196667, LTR 

XIV, 6060. subteL region 

XIV, 4866, subteL region 

Vin, 562588, tel. region 

XII, 5727, subteL region 



xrs2k.l 



xrs2k.l7 



-10 

5' TGXGTGGGrGrGGGTCAATTGTAAAT-CTC CGA-GG 3' 

5' CGTCAAGGATATATTCAATTGTAAAT-CTC CGA-GG 3' 



IX/X, 69/52, tel. region 



xn, 1071797, subteL region 
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